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Abstract We show that, given a word equation over a finitely gener¬ 
ated free group, the set of all solutions in reduced words forms an EDTOL 
language. In particular, it is an indexed language in the sense of Aho. 

The question of whether a description of solution sets in reduced words 
as an indexed language is possible has been been open for some years 
ini ng, apparently without much hope that a positive answer could hold. 
Nevertheless, our answer goes far beyond: they are EDTOL, which is a 
proper subclass of indexed languages. We can additionally handle the 
existential theory of equations with rational constraints in free products 
*i<i < s Fi, where each F, is either a free or finite group, or a free monoid 
with involution. In all cases the result is the same: the set of all solutions 
in reduced words is EDTOL. This was known only for quadratic word 
equations by [5], which is a very restricted case. Our general result be¬ 
came possible due to the recent recompression technique of Jez. In this 
paper we use a new method to integrate solutions of linear Diophantine 
equations into the process and obtain more general results than in the re¬ 
lated paper [5]. For example, we improve the complexity from quadratic 
nondeterministic space in [5] to quasi-linear nondeterministic space here. 

This implies an improved complexity for deciding the existential theory 
of non-abelian free groups: NSPACE(nlogn). The conjectured complex¬ 
ity is NP, however, we believe that our results are optimal with respect 
to space complexity, independent of the conjectured NP. 

Introduction and main results 

The first algorithmic description of all solutions to a given equation over a free 
group is due to Razborov [HI IB]- His description became known as a Makanin- 
Razborov diagram. This concept plays a major role in the positive solution of 
Tarski’s conjectures about the elementary theory in free groups mm- 

It was however unknown that there is an amazingly simple formal language 
description for the set of all solutions of an equation over free groups in reduced 
words: they are EDTOL. An EDTOL language L is given by a nondeterministic 
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finite automaton (NFA), where transitions are labeled by endomorphisms in a 
free monoid which contains a symbol ff. Such an NFA defines a rational language 
1Z of endomorphisms, and the condition on L is that L = {h{ff) | h G 7Z}. The 
NFA we need for our result can be computed effectively in nondeterministic 
quasi-linear space, i.e., by some NSPACE(nlogn) algorithm. As a consequence, 
the automaton has singly exponential size 2 c ’( nlogn ) in the input size n. 

A description of solution sets as EDTOL languages was known before only 
for quadratic word equations by [5]; the recent paper jjjh did not aim at giving 
such a structural result. There is also a description of all solutions for a word 
equation by Plandowski in mi- His description is given by some graph which 
can computed in singly exponential time, but without the aim to give any formal 
language characterization. Plandowski claimed in m that his method applies 
also to free groups with rational constraints, but he found a gap [15] . 

The technical results are as follows. Let F(A+) be the free group over a finite 
generating set A+ of (positive) letters. We let A± = A + U {a -1 | a G A+} C 
F(A + ). We view A± as a finite alphabet (of constants ) with the involution a = 
a -1 . The involution is extended to the free monoid A* j_ by aq • • ■ a& = a/T • ■ ■ 7T[. 
We let 7 r : A* j_ —>• F(A+) be the canonical morphism. As a set, we identify F(A+) 
with the rational (i.e., regular) subset of reduced words inside A±. A word is 
reduced if it does not contain any factor aa where a G A±. Thus, w G A± 
is reduced if and only if n(w) = w. We emphasize that F(A + ) is realized as 
a subset of A±. Let 17 be a set of variables with involution. An equation over 
F(A + ) is given as a pair (U, V ), where U, V G (A±U17)* are words over constants 
and variables. A solution of (U, V) is a mapping a : 17 —> A± which respects the 
involution such that na(U) = n ct(V') holds in F(A + ). As usual, a is extended 
to a morphism o : (A± U 17)* —> A± by leaving constants invariant. Throughout 
we let # denote a special symbol, whose main purpose is to encode a tuple of 
words (wi ,... ,Wk) as a single word w\#- ■ ■ ■ #Wk- 

Theorem 1. Let (U,V) be an equation over F(A + ) and {Xi,...,Xfc} be any 
specified subset of variables. Then the solution set Sol(/7, V) is EDTOL where 
Sol([/, V) = {a(Xi)ff ■ ■ ■ ffa(Xk) \ u solves (U, V) in reduced words}. 

Moreover, there is a nondeterministic algorithm which takes (U, V) as input 
and computes an NFA A such that Sol(f7, V) = \ € L(A)} in quasi- 

linear space. 

The statement of Theorem |T| shifts the perspective on how to solve equations. 
Instead of solving an equation, we focus on an effective construction of some 
NFA producing the EDTOL set. Once the NFA is constructed, the existence of 
a solution, or whether the number of solutions is zero, finite or infinite, become 
graph properties of the NFA. 

Theorem [T] is a special case of a more general result involving the existential 
theory with rational constraints over free products. The generalization is done 
in several directions. First, we can replace F(A+) by any finitely generated free 
product F = *i <i< s Fi where each F t is either a free or finite group, or a free 
monoid with arbitrary involutions (including the identity). Thus, for example 
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we may have F = {a, 6}* * Z * PSL(2, Z) = {a, 6}* * Z * (Z/3Z) * (Z/2Z) where 
a = a and 6 = 6. Second, we allow arbitrary rational constraints. We consider 
Boolean formulae where each atomic formula is either an equation or a rational 
constraint, written as A' £ L, where L C F is a rational subset. 

Allowing rational constraints makes it necessary to specify how the input for 
a constraint is given. We do so algebraically, by using a morphism p : A* —>• N, 
where N is a finite monoid with involution and A = A -1 C F generates F. 
Thus, we write a constraint in the form X £ p~ l (m), with m £ N, and the 
interpretation pa(X) = m. The input size ||4>|| is given by the sum of the lengths 
of all atomic formulae, together with (|A| + |/2|)(1 + log \N\). The specification 
that the solution is in reduced words increases the input size by at most a factor 
ofO(log|A|). 

Theorem 2. Let ¥ be a free product as above, T> a Boolean formula over equa¬ 
tions and rational constraints, and {Xi,...,Xk} any subset of variables. Then 
Sol(^) = {<r( X\)ff ■ ■ ■ #cr(Xk) | cr solves d> in reduced words} is EDTOL. 

Moreover, there is an algorithm which takes T> as input and produces an NFA 
A such that Sol(^) = {q>(if) \ £ L(A)}. The algorithm is nondeterministic 

and uses quasi-linear space in the input size ||^||. 

The proof of Theorem |T] is given in Section [21 Part II of the paper is devoted 
to the proof of Theorem [2] which is more technical and more difficult. 

1 Preliminaries 

We assume that the reader is familiar with big-0 and big-<9 notation. A function 
/ : N —>• R is called quasi-linear if we have |/(n)| £ 0(n log n). For results and 
notation in complexity theory we refer to the textbook [T3]. We also use standard 
notation from combinatorics on words and automata theory according to [7]. 

1.1 Words and involutions 

If A is a set then A* denotes the free monoid over A. An element of A is called 
letter and an element of A* is called word. The length of word w is denoted by 
|u>|, and |u>| Q counts how often a letter a appears in w. 

If M is any monoid and u, v £ M , then we write u < v if u is a factor of v, 
which means we can factorize v = xuy for some x, y £ M. We denote the neutral 
element in M by 1. In particular, 1 denotes also the empty word. 

An involution of a set A is a mapping x <—> x such that x = x for all a; £ A. 
For example, the identity map is an involution. A morphism between sets with 
involution is a mapping respecting the involution. A monoid with involution has 
to additionally satisfy xy = y x. A morphism between monoids with involution 
is a homomorphism ip : M —> N such that <p(x) = g>(x). It is an S-morphism if 
<p(x) = x for all x £ S C M. All groups are monoids with involution given by 
x = a: -1 , and all group-homomorphisms are morphisms. Any involution on a set 
A extends to A*: for a word w = ai ■ ■ • a m we let w = oIH ■ • • of. If a = a for all 
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a £ A then w is simply the word w read from right-to-left. The monoid A* is 
called a free monoid with involution. 


1.2 NFAs, rational and recognizable subsets in monoids 

Let us recall the notion of recognizable and rational set and let us emphasize that 
this concerns incomparable families, in general. See [7] for more background. The 
term regular will be used only in the context of finitely generated free monoids. 

Let M be any monoid. A subset L £ M is called recognizable if there is a 
homomorphism if : M —> N to some finite monoid N such that L = ^ _1 ('0(F)). 
We also say that N or if recognizes L. Recognizability is a “saturation property”: 
we have w £ L if and only if ip(w) £ 'if(L). 

The family of rational subsets RAT(M) is defined inductively as follows. All 
finite subsets of M are rational. If L, L' C M are rational, then the union FUF', 
the concatenation L ■ F', and F + are rational. Define F° = {1} and L l+1 = F • L 1 
for i £ N. Then L + = (J {F* | i > 0} denotes the subsemigroup of M which 
is generated by the subset L C M. We let L* = L + U {1}; then L* is the 
corresponding submonoid. 

For a finitely generated free monoid A* the family RAT (A*) coincides with 
the family of recognizable subsets: this is the content of Kleene’s classical the¬ 
orem (see any standard textbook on formal languages, such as IT]). In general, 
however, the two families are incomparable. For example, for a group G the two 
families coincides if and only if G is finite. 

By definition, if h : M —> M' is a homomorphism, then L i—> h(L) induces 
a mapping RAT(M) —> RAT(M'). The mapping is surjective if and only if the 
homomorphism h is surjective. In the following let M be finitely generated. Con¬ 
sider any surjective homomorphism 7r : T* —> M where F is finite. Then every 
L £ RAT(M) can be specified by some K £ RAT(F*) such that 7r (K) = L. 
The family RAT(F*) coincides with the family of “regular” subsets of F*. Reg¬ 
ular subsets of F* are those which can be accepted by a non-deterministic finite 
automaton, i.e. an NFA. Again, we can define the notion of NFA for arbitrary 
monoids M: an NFA A over M is a tuple A = ( Q , M, 6,1, F ) where Q is a set 
of states , I C Q is the set of initial states, F C Q is the set of final states, and 
S C. Q x M x Q is a finite set of transitions. If (p, m, q) is a transition then 
to £ M is called its label and each path 

(p 0 ,m 1 ,p 1 ),{p 1 ,m 2 ,P 2 ),-- ■, (pk-i,m k ,p k ) 

labels a monoid element toi • • • nik £ M. For states p,q £ Q we let L(A,p , q) C 
M be the set of labels of paths from p to q. Thus, the accepted language L(A) 
of an NFA A is 

L(A) = (J {L(A,p, q) \ p £ I Aq £ F} . 

Moreover, if a regular set K C F* is accepted by some NFA A with state 
set Q = {1,..., n}, then we can choose for a recognizing homomorphism N the 
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monoid of Boolean n x n matrices M nxn . Indeed, for each letter a £ T define a 
matrix p(a) £ B nxn by 



( 1 ) 


Cleary, due to © we have for all w £ T* the equivalence w £ L(A) C r* 
p(w) £ p(L(A)) CB nxn . 


We say that a finite monoid (with involution) N has an efficient representa¬ 
tion if we can specify each element m £ N by 0(log \N\) bits and if all basic op¬ 
erations, like equality checking, (computing the involution), and multiplication, 
are in space C?(log |iV|), too. For example, B" x ™ has an efficient representation 
(if the involution is the transposition). 

In the following, all finite monoids used to define rational constraints are 
assumed to have an efficient representation. Throughout we use the following 
fact: if ipi : M —>• N\ recognizes L\ C M and tfiz '■ M —>• N? recognizes L 2 C M, 
then ifi x if >2 ■ M —>• N\ x N 2 recognizes every Boolean combination of L 1 and L 2 . 
Thus, whenever we add a new constraint by another recognizing homomorphism 
we switch to a larger monoid. There is, however, no size explosion because we 
use finite monoids with an efficient representation and log \Ni x A^| = log |W| + 
log | IV 2 1 • 

1.3 EDTOL systems 

The notion of EDTOL system refers to Extended, D eterministic, Table, 0 inter¬ 
action, and Tindenmayer. There is a vast literature on Lindenmayer systems, 
see m, with various acronyms such as DOL, DTOL, ETOL, etc. The subclass 
EDTOL is equal to HDTOL (see e.g. [201 Thm. 2.6]), and has received particular 
attention. We use very little L-theory: essentially we content ourselves to define 
EDTOL through a characterization (using rational control) due to Asveld |2]. 
The class of EDTOL languages is a proper subclass of indexed languages in the 
sense of [X], see [6]. For more background we refer to [20]. 

Definition 1. Let A be an alphabet and L C A* be a subset. We say that L is 
EDTOL if there is an alphabet C with A C C, a finite set H C End(C*) of 
endomorphisms of C, a rational language R C H*, and a symbol ff £ C such 
that L = (<p(#) | (p £ 1?} . 

Note that for a subset R C H* of endomorphisms of C* we have {^(#) | <p £ R} 
is a subset of C*. Our definition implies that R must guarantee that tp(#) £ A* 
for all ip £ R. The language C is called an extended alphabet. 

Example 1. Let A = {a, 6} and C = {a, b , #, $}. We let H be set of four endo¬ 
morphisms f,g a ,gb,h satisfing /(#) = $$, g„($) = $a, g&($) = $6, and h( S) = 1, 
and on all other letters the f,g a ,gb,h behave like the identity. Consider the ra¬ 
tional language R C H* defined by R = h{g a ,gb}* f (where endomorphisms 
are applied right-to-left). A simple inspection shows that {</?(#) | ip £ R} = 
{vv | v £ A*}, which is not context-free. 
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1.4 Triangulation 


We can replace an equation over any monoid M by a system of triangular equa¬ 
tions (i.e., equations X = V where \V\ = 2) and one additional special equation 
Y = 1, where Y is a fresh variable. The procedure is straightforward: consider an 
equation U = V where U, V £ (AUfi)* are words, A C M, and ft = {Xi... X^} 
is a set of (free) variables. Clearly, 

VXi ,...,X k :(U = V <=>• EX, Y : X = UYY A X = VYY A Y = 1). 

With the exception Y = 1, both equations have the form X = W with \W\ > 2. 
If \W\ > 2 then W = yzW' and we obtain 

\/X, Y,Xi,... ,X k : X = W <=> EX' : (X' = yz) A (X = X'W'). 

Using this we can transform any system of equations into a new system with 
additional variables; and we end up with a system of equations of type X = yz 
and one (singular) equation Y = 1. 

2 Proof of Theorem |T] 

2.1 Preprocessing. 

According to Section fim we start with a system of triangular of equations: there 
is one special equation Y = 1 and all other equations have the form X = V 
where X is a variable and \V\ = 2. Next, we add the special symbol # to the 
alphabet A±; and we define A = A± U {#}. We let # = #; this will be the 
only self-involuting letter in the proof of Theorem [T] We must make sure that no 
solution uses # and every solution is in reduced words. We do so by introducing 
a finite monoid N with involution which plays the role of (a specific) rational 
constraint. 

Let N = { 1,0}UA± x A± have multiplication 1-x = x-1 = x, 0-ie = £-0 = 0, 
and 

(a,6)-(c,d) = {^ d ) lf ^ = ^ (2) 

The monoid N has an involution by 1 = 1, 0 = 0, and (o,6) = ( b,a ). Consider 
the morphism yo '■ A* —> N given by yo (#) = 0 and /xo(a) = (a, a) for a £ A±. 
It is clear that yo respects the involution and yo(w) = 0 if and only if either w 
contains # or w is not reduced. If, on the other hand, 1 ^ w £ A± is reduced, 
then yo{w) = (a, b ), where a is the first and b the last letter of w. Also, yo(w) = 1 
if and only if w = 1. Thus, if a is a solution in reduced words, then for each 
variable X there exists some 0 ^ y{X) = y{X) £ N with y(X) = yocr(X). 

The morphism y allows us to remove the special equation Y = 1. It is replaced 
by fixing y(Y) = 1. 

For the other variables there are only finitely many choices for y , so we assume 
that each equation is specified together with a morphism y : ft —> N. We now 
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require that a solution a : Q —> A* satisfy three properties: na(U) = n rcr(V'), 
<j{X) = cr(X), and /. i(X ) = fiQfj(X) for all X £ 17. 

The next step allows us to work with free monoids with involution rather than 
with groups. This relies on Lemma [U which is well-known, too. Its geometric 
interpretation is simply that the Cayley graph of a free group (over standard 
generators) is a tree. See Figure [1] for a visual explanation. □ 
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Vx,y,z 3P,Q, R : tv(x) = n(yz) -£=£■ tt(x) = PR A ir(y) = PQ A tt(z) = QR. (3) 

Figurel. Part of the Cayley graph of F(A + ) with standard generators: the root 
is 1 on the top. The geodesics to vertices x and y split after an initial path 
labeled by P. 


Lemma 1. Let x, y, z be reduced words in A±. Then x = yz in the group F(A+) 
if and only if there are reduced words P, Q , R in A± such that x = PR, y = PQ, 
and z = QR holds in the free monoid A±. 

Proof. If there are words P, Q, R in A*. j_ such that x = PR, y = PQ, and z = QR 
holds in the free monoid A* j_ then we have tt(x) = tt (yz), hence x = yz in the 
group F(A+). This is trivial and holds whether or not x, y,z are reduced. For 
the other direction, let x,y,z be reduced words in A* j_ such that tt(x) = tt (yz). 
If the word yz is reduced then we choose P = y, Q = 1, and R = z. In the other 
case we have y = y'a and z = az' for some a £ A± and we can use induction. □ 

The consequence of Lemma [T] is that with the help of fresh variables P,Q,R 
we can substitute every equation x = yz with x, y, z £ A± U P by the following 
three word equations to be solved over a free monoid with involution. 

x = PR, y = PQ, z = QR. (4) 

The enlarged set of variables becomes 17 U {P, P, Q, Q, R, P}. Assume a solves 
© such that a(x), cr(y), a(z) are reduced. Let <r(P) = p, a(Q) = q, and a(R) = 
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r, Then p , q , and r are reduced and there is no cancellation in any of the words 
pr , pq , and qr. The lack of cancellation is encoded by p, but note that there are 
various choices for /i(P), p(Q), p(R)' for example, let p(x) = ( b , c), /t(y) = (6, a), 
and /ify) = (a, c) and assume that p,q,r # 1. Then there are a last letter d in 
p and first letters e in q and f in r such that d # e # / # d, so the choice 
/i(P) = (6, d), /t(Q) = (e, a), and /i(I?) = (/, c) is correct. 


2.2 The initial equation 

By Lemma [T] it is enough to prove the analogue of Theorem [T| for free monoids 
with involution, systems of equations (Lfyfy), 1 < * < s and a morphism 
/iinit : (A U 17)* —> N such that 0 # /iinit (Ui) = /iinit(Ui) for all i. Finally, 
we construct a single equation (U',V’) over A U 17, where U' = Pi# • • • #t/ s 
and V' = Vi#---#y s . Notice that ^ init (X) # 0 and |Pi| # = \Vi\# = 0 for 
all i. A solution cr : 17 —>- must satisfy cr(P / ) = o-(fo'), o-(X) = a( A'), and 

dinit(-A) = Ho&(X) for all left. The set of variables {Xi, ..., A#}, specified in 
Theorem [1] is a subset of the new and larger set of variables 17, and the original 
solution set became a finite union of solution sets with respect to different choices 
for po and /iinit- The result holds since EDTOL is closed under finite union. In 
order to achieve our result we have to protect each variable X t as follows. We 
assume A± U 17 = {aq,..., x{\ with aq = Xi for 1 < i < k where {Xi ..., 
is the specified subset in the statement of Theorem Q] 

The word Wi nit over (A U 17)* is then defined as: 

Wi„it = #*!#... #^#P'#W#tF#F 7 #^# ... #aq#. (5) 


Observe that Wmit is longer than (but still linear in) |A| + |17| + \UV\. The 
number of #’s in Winit is odd; and if cr : [A U 17)* —» A* is a morphism with 
<t(X) reduced for all X £ 17, then: n< t(U) = na(V) cr(P , ) = cr(W) 

ct(Winit) = cr(Wnit)- Here (U,V) is the equation in Theorem Q] and {U',V') is 
the intermediate word equation over A U 17. Therefore Theorem [T] follows by 
showing that the following language is EDTOL: 


jer(Ai)#- • -#cr(X fc ) cr(Wi„it) = cr(W in it) A /t in it = p-ocr A VX : a(X) = crfyY)| 


2.3 Partial commutation and extended equations. 

Partial commutation is an important concept in our proof. It pops up where 
traditionally the unary case (solving a linear Diophantine equation) is used as 
a black box, as is done in [5]. At first glance it might seem like an unneccesary 
complication, but in fact the contrary holds. Using partial commutation allows 
us to encode all solutions completely in the edges of a graph, which we can 
construct in quasi-linear space, and is one of the major differences to [5]. As 
a (less important) side effect, results on linear Diophantine equations come for 
free as this is the special case F(A + ) = Z: solving linear Diophantine equations 
becomes part of a more general process. 







We fix n = rij n it = |Winit| and some k £ N large enough, say k = 100. We 
let C be an alphabet with involution (of constants) such that |C| = nn and 
ACC. We define £ = C U f2 and assume that ff is the only self-involuting 
symbol of £. In the following x, y, z,... refer to words in £* and X, Y, Z, . . . 
to variables in fi. Throughout we let B , B' and X, X' denote subsets which are 
closed under involution and satisfy X' C X C Cl and either ACBCB'CCox 
A C B' C B C C. In particular, B and B' are always comparable. 

We encode a partial commutation as follows. Let c C B and either p = c 
or p = cc. (The case p = cc is needed only later in the proof of Theorem [5J) 
Let 9 C (X U B + ) x {p,p} denote an irreflexive and antisymmetric relation. 
It is called a type if 9 satisfies the following conditions: first, ( x,y) C 9 implies 
(x, y) C 9 and x £ XU {a, a, aa | a £ B}\{c, c, cc}, and second, for each x we have 
|0(:r)| < 1, where 9{x) = {y £ B* | ( x , y) £ 6}. The type relation 9 can be stored 
in quasi-linear space. Given 6 and p : B U X —> N such that p(xy) = p(yx) 
for all ( x , y) £ 0, we define the following two partially commutative monoids 
with involution: first, M(B,X } 9,p) = (BU X)*/ {xy = yx \ (x,y) £ 9} - a 
monoid with a morphism p : AI(B 1 X,9,p) —> TV - and second, M(B,9,p) = 
B */ {xy = yx \ (x, y) £ 9}. If w < W £ M(B, X , 9 , p) then w is called a proper 
factor if w ^ 1 and = 0. 

Since the defining relations for M(B, X , 9 , p) are of the form xy = xy , we can 
define \W\ and \W\ a for W £ M(B,X,9,p) by representing W by some word 
If £ (BU X)*. Typically we represent w, W by words w, W £ (B U X)*, but 
their interpretation is always in M{B , X , 9 , p). 

Definition 2. We call W £ M(B,X,9,p) well-formed if \W\ < nil. |W| # = 
|Winit|#; every proper factor x of W and every x £ B UX satisfies p(x) y 0, and 
B* (~l p~ x { 1) = {1}. Moreover, if x is a proper factor then x is a proper factor, 
too. Finally, for every a £ A± there is a factor ffaff in W. 

Definition 3. An extended equation is a tuple V = ( W,B,X,9,p ) where W £ 
M(B,X, 9, p) is well-formed. 

A U-solution of V is a B-morphism cr : AI(B,X,9,p) —> AI(B,9,p) such 
that a(W) = a(W) and cr(X) £ y* whenever ( X , y) £ 9. 

A solution of V is a pair (a, a) such that a : M{B,9,p) —» A* is an A- 
morphism satisfying poa = p and a is a B-solution. 


W = equation, where the solution is a “palindrome” a(W) = a(W) £ A*. 

B = alphabet of constants with ffcACB = BCC. 

X = variables appearing in W. Hence, X = X C Q. 
p = morphism to control the constraint that the solution is reduced. 

9 = partial commutation. 

During the process of finding a solution, we change these parameters, and we 
describe the process in terms of a diagram (directed graph) of states and arcs 
between them. 
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2.4 The directed labeled graph Q. 

We are now ready to define the directed labeled graph Q which will be the core 
of the NFA defining the EDTOL language Sol (U, V) we are aiming for. 

Define the vertex set for Q to be the set of all extended equations V = 
(IT, B, X , 0, /t). The initial vertices are of the form (IT n it, A, 17,0, /Xi n it). Due 
to the different possible choices for /ij n it there are exponentially many initial 
vertices. In fact, Theorem |T] requires us to work in exponentially many graphs 
simultaneously because of the different possible /to : A* —> N. However, to 
simplify the presentation we fix one /To. 

We define the set of final vertices by {(IT, B, 0,0, /t) | W = W} . By defi¬ 
nition every final vertex trivially has a B-solution a = id#. (Note that in a 
final vertex there are no variables.) The arcs in Q are labeled and are of the 

form (l V,B,X,9,fi) (W', B', X', 0', //). Here h : C* —> C* is an endo¬ 

morphism which is given by a morphism h : B' —» B* such that h induces a 
well-defined morphism h : M(B' U X',9',n') —> M(B U X,9,n). Note that the 
direction of the morphism is opposite to the direction of the arc. There are fur¬ 
ther restrictions on arcs. For example, we will have \h(b')\ < 3 for all b'. The 

main idea is as follows. Suppose (IT, B, X, 9 , /t) (IT', B', X' , 0', //) is an arc, 

a : M(B,0,fj) — > M(H,0,/xo) is an H-morphism, and (IT', B', X\ 9', fj,') has a 
B'-solution <r'\ then there exists a solution (a, a) of the vertex (IT, B, X, 9, fi). 
Moreover, for the other direction if (a, a) solves V = (W, B, X,9, fi) and V is 
not final then we can follow an outgoing arc and recover (a, a) from a solution 
at the target node. We will make this more precise below. 

2.5 Compression arcs. 

These arcs transform the sets of constants. Let V = (W,B,X,9,n) and V' = 
(IT', B', X, 9 ', fi') be vertices in Q. The compression arcs have the form V ——> V', 
where either h = e is defined by the identity on C* , or h is defined by a mapping 
c h(c) 7 ^ c with c £ B'. Recall that if a morphism h is defined by h(c) = u for 
some letter c then, by our convention, h(c) = u and h(x) = x for all x £ S which 
are different from c and c. We assume 0 ^ //(c) = /t(/i(c)) / 1 and fi{x) = fi'(x) 
for all x £ (B n B') U X (if not explicitly stated otherwise). 

We define compression arcs ( h(W'),B,X,6,n ) (IT', B', X, 9', fi 1 ) of the 

following types, only. 

1. (Renaming.) We assume that h is defined by h(c) = a such that B£ B' = 
B U {c,c}, and 9 C O'. Thus, possibly, 0^-9'. 

2. (Compression.) We have h(c) = u with |u| < 3 and either B = B' and 
9' = 9 or B £ B' = B U {c,c} and 9 = 9' = 0. 

3. (Alphabet reduction.) We have B' £ B, 9' = 0, and h is induced by the 
inclusion B'CB. Hence we have h = e = idc*- 

For the proof of Theorem |T] we compress words u into a single letter c only if 
| it | < 2. For Theorem [2] we will have in addition the case where u = aac with 
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either a = c (and a = c) or (aa, cc) £ 9. The purpose of arcs of type 3 is to 
remove letters in B that do not appear in the word W. This allows us to reduce 
the size of B and to “kill” partial commutation. 

Lemma 2. Let (W, B , X , 9, p) — (W', B ', A", 0', /./') 6 e an arc of type 1,2 or 3 
with W = h(W'). Let a : M(B , 9 , /.z) —»• M(A, 0, /zo) &e an A-morphism at vertex 
V = (h(W'),B,X,9, p) and a' be a B’-solution to V’ = (W, B', X',0', p'). 
Define a B-morphism a : M(B,X,9,p) —» M(B,0,p ) by a(X) = ha'(X). Then 
(a, a) is a solution at V and (ah, a') is a solution at V' and aa(W) = aha'(W'). 

Proof. By definition, ph = p! and poa = p. Hence (ah, a') is a solution at V. 
Now, h(X) = X for all X £ X. Hence, a(h(X)) = cr(X) = ha'(X). For b' £ B' 
we obtain ah(b') = h(b') = ha'(b') since a 1 and a are the identity on B' and B 
respectively. It follows ah = ha' and aa(W) = aha'(W'). Next, 

a(W) = a(h(W')) = h(a'(W ')) = h(a'(W)) = a(h(W)) = a(hfW 7 )) = a(W). 

Thus, a is a H-solution to V and, consequently, (a, a) solves V. □ 

2.6 Substitution arcs. 

These arcs transform variables. Let V = {W, B , X, 9, p) and V' = (W', B, X',9', p') 
be vertices in Q and X £ X. We assume that X = X' u{X, X} and p(x) = p'(x), 
as well as 9(x) = 0'(x) for all x £ (B U X) \ {X, X}. The set of constants is 
the same on both sides, but X' might have fewer variables. Substitution arcs are 
defined by a morphism r : {X} —>• BX U {1} such that we obtain a .B-morphism 
r : M(B,X,9,p) —> M(B, X',9 ', p'). We let e = iclc* as before. We define 
substitution arcs (W, B,X,6,p) —A (t(W), B, X',9', p') if one of the following 
conditions apply. 

4. (Removing a variable.) Let X' = X \ {X, X}. The B-morphism r : 
M(B,X,6,p) —>■ M(B,X’,0',p') is defined by r(X) = 1. 

5. (Variable typing.) The purpose of this arc is to introduce some type for 
variables without changing anything else, so X' = X and p' = p. Sup¬ 
pose that 9(X ) = 0 and p £ B+ is a word with p(Xp) = p(pX) and 
such that 9' = 9 U {(X,p), (X,p)}. The B-morphism r : M(B,X,9,p) —> 
M (B, X, 9 ', p) is defined by the identity on B U X. Note that the condition 
p(Xp) = p(pX) implies that if p : M(B,X,9,p) —>• N is well-defined, then 
p : M(B , X, 9 ', p) —N is well-defined, too. The other direction is trivial. 

6 . (Substitution of a variable.) We have (B, X , 9) = (B ', X ', 9'). Let p £ B + 
such that 0(X) C {p}. (For 9(X) = 0 this is always true.) We suppose that 
we have p(X) = p(p)p'(X) (hence, automatically p(X) = p'(X)p(p)) and 
that t(X) = pX defines a morphism r : M(B, X , 9, p) —> XI (B, X , 9, p'). 

Lemma 3. Let V = (W,B,X,9,p) —^ (W ', B, X',9 ', p') = V' with e = idc* 
be an arc of type 4,5,6 with W' = t(W). Let a : M(B,9,p) —> M(A,$, pfi] be 
an A-morphism at vertex V and a' be a B-solution to V'. Define a B-morphism 
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a : M(B, X,9, /z) —> M(B,9, /T) by cr(X) = ct't(X). Then (a, a) is a solution at 
V and (a, a') is a solution at V'. Moreover, aa(W) = aha'(W') where h = e is 
viewed as the identity on id m(b,8,h)- 

Proof. Since a ' is a B-solution to V' we have a(W) = (j'{t(W)) = <j'{t{W)) = 
<t't(W) = a(W). Hence, (a, a) is a solution at V. Since M(B , 9 , /z) = M(B, 9', /i') 
(a possible change in /i or 0 concerns variables, only), (a, a') = is a solution at 
V'. The assertion aa(W) = aha'(W') is trivial since W' = t(W), (j = a'r, and 
h = s induces the identity on M(B , 9 , n). □ 

Proposition 1. Let Vo -4- Vi ■ ■ ■ V t be a path in Q of length t, where 
Vo = (bVmit, A, Q, 0, /Zi„it) is an initial and Vt = iW’, B , 0,0, /z) zs a final vertex. 
Then Vo has a solution (id^jCr) with criW^t) = hi ■ ■ ■ ht(W'). Moreover, we 
have W’ £ #z«i# • • • ffukffB* such that \uf\^ = 0 and we can write: 

hi--- h t (u iff ■ ■ ■ #u fc ) = ct(Xl)# • • • ffa(X k ), (6) 

Proof. By definition of final vertices we have W' = W' and no variables occur 
in W'. Hence, ids* defines the (unique) _B-solution of W’. By definition of the 
arcs, h = h\ -■ - h t : M(B, 0, /z) — > A* = M(A, 0, /z init ) is an A-morphism which 
shows that (/i,ids*) solves W'. There is only one A-morphism at Vo, namely 
idyi». Using Lemma [5] and Lemma [3] we see first, Vo has some solution (idA'jCr) 
and second, 


kb4*cr(Winit) = id A *hi ■ --htidB*{W') = hi ■ --h t (W'). (7) 

Finally, for 1 < j < t we have hj(ff) = # and \hj(x)\# = 0 for all other symbols. 
Hence the claim hi ■ ■ ■ h t {u\ff ■ ■ ■ #zz/c) = a(X-■ ■ ■ ffa(Xk). □ 

2.7 Construction of the NFA A. in quasilinear space 

The input to Theorem [T| is given by three items: A ±, f], and the pair of words 
U, V £ (A± U 17)*. The input size in bits is in 

®({\UV\ + \A± U Q\) log(| A± U 17|)). 

Our non-deterministic procedure will use space 0((\UV\ + \A± U 17|) log(|t/U| + 
| A± U 17|)), which is quasi-linear in the input size. 

Let no = {\UV\ + \A± U 17|) log(|A± U 17Q) be the input size. The definition 
of n = riinit = |Winit| gives n £ O(no), and therefore we can also choose n = n; n it 
as the new input size. In the preprocessing we transformed the original input 
into the word Wi n it over a larger set of variables. However, transforming the 
initial data into Wi n it is easily implementable in quasi-linear space. 

The next observation is that, according to Definition [2j every well-formed 
word W can be stored with O(nlogn) bits. Moreover, given W £ M{B,X,6,pf), 
it can be checked (deterministically) in space 0(n\ogn) whether it is well- 
formed. The same is true for extended equations according to Definition [31 
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Next, we can consider in space O(nlogn), one after another, all candidates 

(IT, B , X, 9 , /x) (IT', B 1 , X, 6 ', n') for arcs in Q. (We use that, by construc¬ 

tion, h can be encoded by a list of length 0(n) for all arcs in Q.) Each time we 
check (deterministically) in space 0(n\ogn) whether there is indeed an arc h. If 
the answer is positive, we output the arc. The switch from the graph Q to the 
NFA A is computationally easy: for every final vertex (W', B, 0, 0, p) compute a 
prefix #w/# of W with \w'\# = k and output the arc (IT', B, 0, 0, /x) -^4 #. 

We used nondeterminism during the procedure, but this was not really nec¬ 
essary, as we could have produced an NFA satisfying Theorem |T] in deterministic 
quasi-linear space. However, such an NFA would contain a lot of unnecessary 
states and arcs. 

In order to justify the last sentence in the abstract that deciding the ex¬ 
istential theory of non-abelian free groups is in NSPACE(n logn), (i.e., non- 
deterministic quasi-linear space) we need to check whether L(A) 0. Thus 
we integrate this check into the construction of the NFA A right-away. So we 
modify our construction: instead of computing all vertices and all arcs of Q 
we output vertices and arcs only if they belong to a path from an initial to 
a final vertex. More precisely, for each vertex V we define a Boolean variable 
Useful(V) which is 1 if V is on a path from an initial to a final vertex and 
0 otherwise. In order to compute Useful(V) we guess such a path in nonde- 
terministic space 0(n\ogn). Here we use the standard fact that the nondeter- 
ministic complexity class NSPACE(nlogn) is closed under complementation by 
Immerman-Szelepcsenyi (1987), see I13| . Thus, our procedure outputs a ver¬ 
tex V (resp. an arc V = ( W, B, X,9, p) (IT', B ', X,9 ', p') = V') only 

if Useful(T) = 1 (resp. Useful(T) = Useful(T') = 1). If no vertex satisfies 
Useful(T) = 1, then we can output a one-state NFA without final states be¬ 
cause then Winit has no solution. If however, at least one vertex V satisfies 
Useful(T) = 1, then the output is an NFA A which accepts a nonempty set of 
endomorphisms over C and, according to our construction, ITi n it has at least one 
solution; moreover, the reader is invited to show that ITinit has infinitely many 
solutions if and only if L(A) is infinite. 


2.8 Forward property of arcs 

The previous section has shown how to produce the graph Q, which is now at 
our disposal. For every initial vertex Vq with a given solution (id^*, cto), we need 

to establish the existence of a path Vq —4 Ti • • • -4* V t to some final vertex 
V t = (IT', B } 0, 0, fj) such that the following equation holds 

a 0 (W) = h 1 ---h t (W'). (8) 

This relies on the following technical concept. 

Definition 4. Let V = (IT, B , X , 9 , /x) -4* (IT', B', X’ , 9', /j/) = V' be an arc in 
Q and (a, a) be a solution at V. We say that the tuple (V — V', a, a) satisfies 
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the forward property if there exists a solution ( ah, a') at V' such that 

aa{W) = aha'(W'). 

Lemma 4. Let V = (W,B,X,9,p) (■ t(W), B, X',9', p') = V' be a substi¬ 

tution arc as in 4 or 6 and (a, a) be a solution at V. Suppose that a(X) = uv 

and t(X) = uX. Then (V ——> V',a,a) satisfies the forward property. 

Proof. If we let cr'(X) = v and p'(X) = p(v) then we can write er = red 
and the morphism er : M(B,X,9,p) —» M(B,9,p) factorizes through o' : 
M(B, X',9',p!) ->■ M(B,6,n). □ 

Lemma 5. Let V = (W,B,X,9,p) —>■ (t(W), B, X,0', p) = V' be a variable¬ 
typing arc as in 5 and (a, a) be a solution at V such that cr(X) £ c*. (Thus, 
we have p(Xc) = p{cX) £ p(c*); and the arc with the new type (X, c) £ 9 1 is 

defined.) Then (V —V',a,a) satisfies the forward property. 

Proof. The morphism er : M(B,X,6,p) —> M(B,0,p) factorizes canonically 
through a’ : M(B , X, 6', p) —> M(B , 6, p). □ 

Lemma 6. Let V = ( h(W ), B , X, 9, p) (W, B ’, X, 9p') = V' be any com¬ 
pression arc as in 1.2 or 3 and (a, a) be a solution at V. Suppose there exists a 
B' -solution a' at V' such that a : X —>• M ( B , 9, p) factorizes through morphisms 
as follows 

a-.X-^r M{B9', p') M(B, 9, p). 

Then (ah, a') is a solution at V' with aa(W) = aha'(W'). In particular, (V — 
V', a, a) satisfies the forward property. 

Proof. Trivial. □ 

It is clear that Lemma [G] does not suffice for our purpose. We cannot prevent 
er from using letters from B which are not present in B', but then no factorization 

er : X M(B',9', p!) —M(B,9,p) exists if h is induced by the identity, as 
in the case of alphabet reduction. As already mentioned in the main body of the 
text, we need this type of alphabet reduction only over empty type relations. 
We content ourselves with the following statement. 

Lemma 7. Let V = ( W,B,X,$,p) (W, B', X, 0, p!) = V' be an alphabet 

reduction as in 3, where B' Cf. B and p' is the restriction of p. Let (a, a) be a 
solution at V. Define a B'-morphism (3 : M(B,$,p) —> M(B' ,0, p') by /3(b) = 
a(b) for b £ B \ B' and define cr'(X) = (3a(X ). Then (ah, a’) is a solution at 

V 1 with aa(W) = aea'(W’). In particular, (fo — V',a,a) satisfies the forward 
property. 
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Proof. Since a : —>• M(A, 0, Ho) is a morphism, we have /x/3(&) = 

/j,a(b) = ho cn(b) = /x(6) for all b £ B \ B' and ft is indeed a morphism from 
M(B,0,/x) to M(R',0, M '). 

Note that M(B', X, 0, /x') is a submonoid of M(B , T, 0, /x) and e realizes the 
inclusion of these free monoids. Hence W = e(W') = IV' as words. In particular, 
a(W) = er(lV) implies er'(IV') = a'(W'). Thus, (as, a') solves V'. 

Finally, by definition of ft we have a = aft because a is an H-morphism. 
Hence a = aeft and we obtain 

aecr'(IV') = aea'(W) = aefta(W) = aa(W). 


□ 


2.9 Compression. 

This section finishes the proof of Theorem [L] Consider an initial vertex Vo = 
(Winit, A, 17, 0, Hinit) with a solution (a, a). We will show below that Q contains 
a path Vo -^4 V\ ■ ■ ■ V t to some final vertex V t = (W', B , 0, 0, p) such that 
c(IVinit) = hi--- h t (W'), and so Q contains all solutions to Wi n i t . Let us show 
why then, indeed, we are almost done with Theorem [lj We augment the graph 
Q by one more vertex which is just the symbol #. Recall that {Xi ,..., A'*,} has 
been the set of specified variables. Every final vertex (IV', B, 0, 0, fi) has a unique 
factorization IV' = ffw'ffw" with \w'\^ = k. Let us add arcs (IV', B, 0, 0, /x) -^4 
ff where g w ’ : C* —> C* is the homomorphism (not necessarily respecting the 
involution) defined by g w >(#) = w'. If we define the NFA A as Q with this 
augmentation and if we let ff be the exclusive final vertex, then by Proposition!!] 
we obtain Theorem [l] The construction of A can easily be implemented by a 
nondeterministic procedure that uses 0(n\ogn) space. In order to show the 
existence of the path from Vo to Vt we apply the recompression methocj^ of 
[5], but with a new and improved treatment of “block compression”. We avoid 
solving linear Diophantine equations, and give a structural theorem involving 
EDTOL languages that is more precise than the result in [5], 

We show the existence of the path corresponding to the solution (a, a) using 
an alternation between “block compression” and “pair compression”, repeated 
until we reach a final vertex. The procedures use knowledge of the solution being 
aimed for. We proceed along arcs in Q of the form V = (IV, B , X , 9, /x) V' = 
(IV', B', X', 9', /x') thereby transforming a solution (a, a) to V into a solution 
(a',cr') to V'. However, this is not allowed to be arbitrary: we must keep the 
invariant ckt(IV) = a'ha'(W’). For example, consider the alphabet reduction 
where B' B and IV = IV' G (£?' U X)*. In this case we have h = idc*, 
which induces the inclusion e : M(R', 0, /x') —» AI(B,9,h). If a does not use 
letters outside B' there is no obstacle. In the other case, fortunately, we will 
need alphabet reduction only when the type relation is empty on both sides. 
Then we can define ft(b) = a(b) G A* for b G B \ B' and ft(b) = b for b G B'. 

1 Compression became a main tool for solving word equations thanks to EEj. 
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We let a'(X ) = /3a(X). This defines a B'-solution at V'. In some sense this is a 
huge “decompression” making <r'{W) perhaps much longer than a(W). However, 
{ae,a') is a solution to V'. 

A word in w £ A* is a sequence of positions, say 1,2, ...,|w|, and each 
position is labeled by a letter from S. If W = uqXiUi ■ • ■ XmUm, with Ui £ C* 
and Xi £ fi, then <r(W) = Uo<j(xi)ui ■ ■ ■ a(x m )u m and the positions in a(W) 
corresponding to the iq’s are henceforth called visible. 


Block compression Let V = (W, B, X, 0, p) be some current non-final vertex 
with an empty type relation and a solution (a, a). We start a block compression 
only if B < \W\ < 29 n. Since \C\ = lOOn, there will be sufficiently many “fresh” 
letters in C\B at our disposal. 

1. Follow arcs of type 4 and 6 to remove all variables with |cr(X)| < 2. Thus, 
without restriction, we have |cr(A)| > 2 for all X. If V became final, we 
are done and we stop. Otherwise, for each X we have u(X) = bw for some 
b £ B and w £ B + . Following a substitution arc of type 6. we replace X 
by bX. (Of course, we also replace X by Xb, changing n{X) to p{X) = 
p{X) = p(w). From now on we always do this without further comment.) 
Every substitution X *->■ bX decreases Ylxex \ a<J {X)\, a fact which will be 
used later. Moreover, if bX < W and b'X < W are factors with b, b 1 £ B , 
then # ^ b = b 1 due to the previous substitution X i->- bX. For each 
b £ B\{#} define sets A/, C N which contain those A > 2 such that there is 
an occurrence of a factor db x e in a(W) with d ^ b ^ e, where at least one of 
the V s is visible. We also let = {X £ X \ bX < W A a(X) £ bB*}. Note 
that J2 b \A^\ + \Xb\ < \W\. Since W is well-formed we have At, = A b . 

2. Fix some subset B + C B such that for each # ^ b £ B we have b £ B + <S=> 
b 7 ^ B+. For each b £ B+, where Ab ^ 0, run the following b-compression: 

3. 6 -compression. (This step removes all proper factors b l and b , £ > 2, from 
W.) 

(a) Introduce fresh letters Cb,cE with p.(cb) = p(b). In addition, for each A £ 
Ab introduce fresh letters c\ : b, c\ t b with ^(cv.b) = p.(b). We abbreviate 
c = Cb, c = Cb, c\ = c\,b, and ca = ca> We let h(c\) = h{c ) = b 
and we introduce a type by letting 9 = {(ca,c) | A £ Ab}. Renaming 
arcs 1 realize this transformation. We did not touch W = h(W), but we 
introduced partial commutation. 

(b) When we introduced c, c\ we did not change W, but we changed the 
alphabet B to some larger alphabet B'. Now we change W and its solu¬ 
tion. We start to replace in cr(W) £ B* every factor db x e (resp. db e ), 
where d ^ b ^ e and A £ Ab, with dc x e (resp. dc x e). This yields a new 
word W' £ B '*, which was obtained via the renaming arc h(c) = b. Re¬ 
call that for every X £ Xb we had bX < W and for some positive b we 
had cr(X) = b e w with w bB*. In the new word W' we have cX < W' 
and for the new solution a’ we have a(X) = c e w' with w' ^ cB'*. We 
rename W', B', a' = ah, o' as W, B, a, a. 
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(c) 

(d) 


(f) 


(e) 


We define 0 = {(c*, c) | A G A {,} U {(X, c) | X G Xb A er(A') G c*}. This 
can be realized by arcs 5. 

Let W G M(B,X,9, fj.) be given by some word in W G (B U X)*: scan 
the word a(W) G B* from left to right. Stop at each factor dc x e with 
d ^ c ^ e and A G Ab- If in this factor some position of the c’s is visible 
then choose exactly one of these visible positions and replace that c by 
c\. If no c is visible, they are all inside some cr(A'); then choose any c 
and replace it by c^. Recall that c and c\ commute, hence dc x e became 
dc\c x ~ 1 e = dc? 1 c\c?*e G M(B,6,n) for all £\ + £2 = A — 1. After that 
we run through the same steps for c. The whole transformation can be 
realized by renaming arcs 1 defined by h(c\) = c. There is a crucial 
observation: if X G Xf, and we had a(X) = c^w with w (/L cB* before 
the transformation then now still a'{X) = c^w', but due to commuta¬ 
tion c\cr'{X) is a factor in <j'(W') G M(B,6,n). For example, assume 
Xc 2 YcZcXcYd < W with er(X) = cdc, cr(Y) = cdc , and cr(Z) = c 2 . 
Then the corresponding factor in W' looks as A'cJ cYceZcXcZYd = 
A'cl cY cqcZX aiYd G M(B, X, 9 1 n), but cq and Z do not commute. 
However, it is important only that <j'(Z) = c 2 and c§ commute, which 
they do. 

Rename W',B',a' = ah, a' as W, B,a,a. Perform the following loop 

G Xb with <t(X) G c* occurs in W. 


3(e)i 3(e)iv 


a = an, <r 
until no c and no A' 


i. If A G Xb and the maximal c-prefix of ct(A) is odd then follow an 
substitution arc A 1 —> cX. Do the same for b and c. 

ii. If in the new solution a'(W') there is a factor dcxc^e with £ odd, then 
inside this factor the word c\c is visible due to commutation and the 
previous step. In this case follow an arc 2 defined by h{c\) = cc\. 
Thus, w.l.o.g £ is even for all dc\c e e < 

iii. Follow a compression arc defined by h{c) = c 2 . 

iv. Remove all A with <t'(X) = 1 by following an substitution arc; and 
rename W', B', X', a! = ah, a' as W, B, X, a, a 

Let B' = B\ {c, c} and // be induced by /i. Observe that no core 
appears in cr{W)-. they are all compressed into single letters c\. Thus 
the type relation of B U X' is empty again. Hence we can follow an 
alphabet reduction arc (W, B,X,9, /z) (W, B', X', 0, \J). The new 

solution to {W, B', X', 0, //) is the pair {a', a) where a' = ae is defined 
by the restriction of a to M(B', 0, //). 


Having performed 6-compressions for all 6 G B + , we have increased the length of 
W. But it is not difficult to see that the total increase can be bounded in 0{n). 
Actually, we have \W\ < 3In at the end because we started with \W\ < 29 n 
and step 1 of block compression increases \W\ by at most 2 n, and no other step 
increases \ W\. Now we use alphabet reduction in a final step of block compression 
in order to reduce the alphabet B such that \B\ < \W\. We end up at a vertex 
named again V = (W, B, X, 0, fj), which has a solution (a, a). The difference to 
the situation before block compression is that now \B\ < \W\ < 31n, and no 
proper factor 6 2 , b G B, can be found in W anymore. 
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Space requirements for the block compression We start the block com¬ 
pression at a vertex V = (W, B, X,®, (i) with a given solution (a, a) only if 
\B\ < \W\ < 29 n. For example, every initial vertex having a solution (idA*,u) 
falls into that category. Now we recall all steps. 

First we removed variables X with |<r(A)| < 2, and for the remaining vari¬ 
ables we did some substitution bX, which together increases the length by 
at most 2 n. Hence we reached a vertex V' = (W', B, X' , 0, //) with \W'\ < 31 n 
via arcs satisfying the forward property by Lemma 0] Inspecting the procedure 
step by step, we verify whether each time we follow an arc if it satisfies the 
forward property using Lemmas 010 and [HI (but without using Lemma 0 during 
the block compression). So wherever we stop, Equation (0) is valid during this 
compression procedure if it was valid before. 

Next we have to show termination and also that we stay inside the graph 
Q during the procedures. Termination can be seen as follows. Renaming arcs 
are used at most 0(n) times, so they are irrelevant. No arc increases the sum 
y |a(A)|. Either it decreases this sum, or if this sums remains stable, then 
\W\ decreases. This shows termination. 

In order to show that we remain inside Q we have to control the possible 
fluctuations of \W\ and \B\. 

After step 1 we have \B\ < \W\ < 31n. Step 3(a) increases \B\ by introducing 
fresh letters Cb,Cb plus Ab. Since each such new letter corresponds to a visible 
letter in W, after this step we have \B\ < 62 n. No other steps increase \B\, and 
at the end we have \B\ < absW. 

Steps 2, 3(a)-(d) and 3(f) do not change the length of W. The only possible 
increase to \W\ can occur in step 3(e). Part 3(e)i may cause an increase of at 
most 2 n. Part 3(e)iii does not increase length, and part 3(e)iii decreases the 
length of c,c-blocks by half. We now show that at step 3(f) we have \W\ < 31 n 
again. 

Let O t = {cj,..., cj} be the set of letters core that are added to W in the 
t-th iteration of step 3(e)i, so r < 2 n. After step 3(e)i either 

1. there is a distinct letter cf G {c,c} for each cj G O t so that ejej < W, in 
which case the compression h(c) = cc in step 3(e)iii will replace by cf, 

2. there are factors in W of the form dc\ac\ x ... cj( e where d 1 e qL {c,c} and 
a € {c e ,c £ | 0 < e < s} (so each cj. does not have a distinct c,c letter 
matching it). In this case each c| came from replacing a variable Xj with 

a(Xj) = cj.. Some of the cj. letters will be consumed by steps 3(e)ii and 
iii, but if s — e > 2 then some will remain. However for each such letter we 
have one less variable in W, so the number of such letters that can possibly 
be carried to the next iteration of step 3(e) during the entire procedure is 
bounded by 2 n. 

So during each iteration of the loop, the number of letters increases in step 3(e)i 
by at most 2 n, and the number of c' letters that remain after an iteration is at 
most 2n, so \ W\ < 31n + 4n = 35n throughout step 3(e). At the end of step 3(e) 
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all c,c letters have been consumed, which includes the letters c' added in step 
3(e)i, so \W\ < 31 n. 

Hence we end the procedure with an increase in size which comes only from 
the first step: we end at some vertex V' = (W 7 , B', X, 0, p) where the type 
relation is empty again. Although B ' might be larger than B 1 we have \B'\ < 
\W'\ < 31 n. 


Pair compression After one round of block compression we run Jez’s procedure 
pair compression. It brings us back to |J3| < \W\ < 29 n and allows us to start 
another block compression. This is essential because it keeps the length in 0{n). 

Let us explain the procedure in detail. This is similar to El . but some of 
the technical details are different. We roughly follow [5], 

We start a pair compression at a vertex V p = (W, B, X, 0, p) with \B\ < 
\W\ < 31n, where W contains no proper factor b 1 2 for any b £ B. Thus we can 
assume to have just performed a block compression. Moreover, we assume that 
(a, a) is a solution to V. The goal is to end at a vertex V q = {W", B", X' , 0, p'") 
with \B"\ < \W"\ < 29n by some path satisfying the forward condition. 

We begin with a random partition B\ {#} = LUR such that b £ L •$=> b £ 
R which is constructed as follows. We first write B \{#} = B + U \b | b £ B + } as 
a disjoint union. This is possible because B \ {jf\ has no self-involuting letters. 
Next, for each b £ B + we choose uniformly and independently whether either 
b £ L and b £ R, or b £ R and b £ L. 

We have ab £ LR <S==> ba £ LR , hence the compression respects the 
involution, and there is no overlap in a(W) between any occurrences of ab £ LR 
and cd £ LR, unless it is the same occurrence and ab = cd. The idea is to 
compress in one phase all factors ab £ LR into a single fresh letter. But the 
obstacle is that in cr(W) an occurrence of some ab £ LR can be “crossing”. This 
means, there is some aX < W (resp. Xb < W) with cr(X) £ bB* (resp. a(X ) £ 
B*a). In this case, a(W ) has an occurrence of a factor ab where a is visible, but 
b is not visible. This forces us to “uncross” ab, which is a basic idea from El 
and appears in steps (2.) and (3.) below. 

1. Remove all X £ X with cr(X) = 1 via substitution arcs. 

2. Make B' large enough such that B C B' C C and for each ab £ LR there 
exists a uniquely defined “fresh” letter c a b £ B'\B, subject to the condition 
that first, ab < <j(W) and second, there is at least one occurrence of that 
factor such that either the position of a or b (or both) in a(W) is visible. 
Let us count how many fresh letters c a b we need. There are at most 31n 
positions in a(W) which are visible since |Wj < 31n. Moreover, each visible 
position leads to at most one factor ab £ LR. Thus, the number of c a b is less 
than 31n. Now, for each c a b we let c^b = c^-. Let us argue that c^- is present. 
This is clear if ab < W because in that case ba < W, too. Otherwise some a 
is visible and aX < W for a variable X. We obtain Xa < W, so the position 
of a is visible in a{W), and, hence, ba < a(W) creates the letter c^-. Thus, 

| B' \ B\ < 31n and therefore \B'\ < 62 n. 


19 


The following always holds: ab ^ ba. If we assume the contrary, then a 
factor ab = aa appears in er(lT) and suppose a is visible. We cannot have 
aa < W because in a well-formed word there are no proper factors w with 
p(w) = 0. Hence, (by symmetry) we must have aX < IT for some A' with 
cr(X) = aw' and, as a consequence, p(X) = p,(a)p,(w') = (a, a) ■ p(w’) £ N. 
Thus, /j,(aX) = 0, but aA' is a proper factor of IT, contradiction. Note the 
rather far reaching consequence of this last tiny computation: as ab ^ ba 
we can compress later unambiguously ab into c a b and ba into without 
creating any self-involuting letter. Thus, c a b ^ cVL and we maintain the 
invariant that no other symbol than # is self-involuting. 

We realize this alphabet enlargement, from B to B', via compression arcs 
labeled by h(c a b ) = ab. So far we have not changed IT. We simply followed 
compression arcs which satisfy the forward property. We are now at some 
vertex V = (W, B', A", 0, //) with the solution (ah, a). Indeed, aha(W) = 
aa(W) as a(W) £ B* C B'*. 

3. Create a list C = {X £ X’ \ 3b £ R : a(X) £ bB*}. For each X £ C do in 
any order: 

— if cr(X) £ bB* with b £ R then follow a substitution arc X t-+ bX. 
Remember, if we follow X e+ bX then automatically X is replaced with 
X b , too; and b £ L. We also have {AT, A'} C £ if and only if tr(A) £ bB*a 
for some ab £ LR. In that case we actually substituted X by bXa and X 
by aX b. In any case, we have successfully “uncrossed” every ab £ LR. We 
followed substitution arcs which satisfy the forward property by Lemma [TJ 
We are now at a vertex V' = (W', B', X', 0, p") with a solution (ah, a'). If 
ab < er'(IT') then in every occurrence of ab either both positions er'(IT') in 
are visible or neither are. This concludes the “uncrossing”. 

4. For each ab £ LR such that c a b was generated follow a compression arc 
labeled by h(c a b) = ab: we replace all occurrences of ab in er'(IT') by the letter 
Cab , and simultaneously replace all occurrences of ba in cr'(IT') by the letter 
c a b, with no overlapping ambiguity. It is therefore clear that the solution a' 
factorizes through h. Due to the previous “uncrossings”, all arcs satisfy the 
forward property. Note that h 2 = h because h : B' —» B' is a R-morphism 
and a,b £ B. Hence, we are now at a vertex V' = (IT", B', X', 0,//") with 
IT' = h(W") which has a solution (ah, a”) where cr" = her'. 

5. Finally, we perform an alphabet reduction as in 3 which replaces B' by some 
smaller alphabet B". As #a# < IT" (we never touched any occurrence of 
#a#) we see that automatically A C B". The alphabet reduction is done 
when the type relation is empty. Thus, we use Lemma [7] to see that we can 
realize this step by an arc which satisfies the forward property. We achieved 
our goal of reaching a vertex V q = (IT", B", X', 0, //") with a solution of the 
form (ah, Pa"). This finishes the procedure pair compression. 


The termination of the procedure is immediate. The maximal number of 
steps is bounded by 0(n). Suppose V p = (IT, B, X, 0, p) is the start vertex and 
V q = V" = (IT", B', X', 0, //") is the endpoint. Denote this path by 




P+1 


hp+2 h p+3 

- > Vp+2 - 1 
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Along the path we used only arcs satisfying the forward condition. Thus, starting 
with a solution (a, a) we find some solution (ah p+ 1 • • • h q , a") to V q such that 

aa(W) = ahp+i ■ ■ ■ h q <r"(W"). 

Thus we maintained the validity of Equation (jSJl throughout all iterations of 
block and pair compressions. 

What remains to be shown is that we can achieve \W"\ < 29 n for at least 
one partition B \ {#} = LU R. 

We reformulate the probabilistic argument of El in our setting. We started 
a pair compression at a vertex V p = {W, B, X, 0, y) with \W\ < 31 n. Let us 
factorize the word W G (B U X) + as W = xqU\Xi ■ ■ ■ u m x m such that 

1. Ui G (B \ {#}) + for 1 < i < to, i.e., each is a nonempty word over 
constants, 

2. The length of each \ui\ is divisible by 3, 

3. Xi G (B U X)* for 0 < i < m, 

4. \x 0 ---x m \< 3 n. 

This is possible because \W\^ + J2xex \W\ X < n ■ We need |a"o • ■ -a; m | < 3 n 
rather than n because we must adjust the lengths of the x^s in order to guarantee 
divisibility by 3 of the |iq|’s. By inserting factors of the form Xi = 1 we may 
assume: 


|it,: | = 3 for all 1 < i < to, (9) 

\w\ = \x 0 ■ ■ -x m \ + 3m. (10) 

Consider the word W' which was obtained by the substitution arcs, but before 
the compression of factors ab G LR into single letters. The increase in length 
\W'\ — \W\ is due to substitution arcs X H- bX,X Gift with A' G £, so the 
length goes up by 2 n. Note that the m factors do not change, only the factors. 
Hence, W' has the factorization W' = yoU\y\ ■ • ■ M m j/ m with y, G (B U X)* and 

\yo---y m \ < \xo---x m \ +2n. (11) 

Let W" be the word after pair compression has been performed. So \W"\ < 
|W ; |. If |W| < 27 n then 

\W"\ < \W'\ = | y 0 ...y m | + \ui. ..u m I < |a; 0 ■ ..x m \ + 2 n + \ui. ..u m \ 

= \xqui ... u m Xm\ + 2n = \W\ + 2n < 27 n + 2n = 29 n 
and we are done. 

Hence, let us assume 27 n < |Wj. We have 27 n < \W\ = jcco - - - Xm\ + 3to < 
3n + 3 to which means m > 8n. 

The word W" is the compression of a word y^viyi ■ ■ ■v m y m where each Vi 
is the result of the compression restricted to Ui. Each m can be written as 
Ui = abc with a,b,c G B. Since W did not contain any proper factor cP with 
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d £ B, (as we have performed block compression first) we know a ^ b ^ c. 
There are two possibilities: either b £ L or b £ R. In the first case, either 
c £ R or c £ L , and in the second case either a £ L or a £ R. Each event 
be £ LR^bc £ LL,ab £ LR,ab £ RR has probability so with probability 1 
Ui is compressed from length 3 to 2, and with probability 1 it remains length 
3. Thus, for the expectation we obtain E[\vi |] = | + | = |. By linearity of 
expectation, we obtain 


E[\vi ■ ■ -VmW = §m. (12) 

Since the expected length is |?n, and m is even, there must exist at least one 
partition B \ {#} = L U R which satisfies |ni • • • v m \ < |m. 

So, we “change our algorithm” and force the algorithm to choose exactly 
this partition (in other words, there is a path in the graph which chooses this 
partition, so we follow this one). We can thus guarantee |i>i • • • v m \ < |to by this 
choice. We may estimate the length of W" as follows. 


\W"\ < | y 0 ■■■y m | + \vi ■■■v m \ 

< \x 0 ■ ■ ■ x m \ + 2n + |m 

= |W| + 2n^^ since \W\ = |a;o • ■ • x m \ + 3m 

< \W\ — 2 n since m > 8 n 

< 29n since \W\ < 31 n. 

Due to \W"\ < 29 n 7 we can run another block compression, then a pair 
compression and so on. We alternate between block and pair compressions inside 
the graph Q 1 always following arcs satisfying the forward condition. 


Compression terminates Let s(0) = |Wi n it| and s(i) be the word obtained 
from s(i — 1) by a single of application of block then pair compression. 

A priori, although we have s(i) < 29 n for all i > 0 the compression method 
could run forever. Let us show that this can never happen. 

Lemma 8 (Termination). The alternation between block and pair compres¬ 
sion terminates with some final vertex. 

Proof. By contradiction assume the contrary. Then there exists an infinite path 
alternating between block and pair compressions satisfying the forward condi¬ 
tion. However, due to the first step in the block compression this infinite path 
uses infinitely many substitution arcs X t— >• bX , where b is a constant. As all arcs 
satisfy the forward condition for each substitution, each use of an arc X bX 
decreases J2 x ex l acr (^OI- No use of any arc increases this sum. Since we started 
with a fixed solution (id^*, do) at some initial vertex, there are no such infinite 
paths. This is a contradiction, and Theorem[T]is shown. □ 
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PART II. Proof of Theorem [2] 


This part contains some repetitions of what has been written above. We hope 
that these redundancies make the reading easier. More importantly, the set-up 
for Theorem[5]is more general, and this generates additional technical arguments. 
The proof of Theorem [2] is therefore more technical and more difficult than that 
of Theorem [I] because we allow more general constraints, and we have to cope 
with the elements of order 2 which appear in the free products - as for example 
in the modular group. 

3 Preliminaries for the proof of Theorem [2] 

3.1 Free products: special features of F 

Our results hold for finitely generated free products 

IF = *i<i< P Fi 

where each A* satisfies one of the following conditions: 

— Fi = F(A) is a free group with basis A- 

— Fi = A* is a free monoid with involution over a set A with involution. 
(Recall that the involution on A: might be the identity. Hence, if A* is the 
free monoid, then the involution means reading words from right-to-left.) 

— Fi is a finite group. (Every group is viewed as a monoid with involution by 
defining x = x _1 .) 

The monoid F is a monoid with an involution that is induced by the involutions 
on each A- It is not essential that the finite monoids A are groups, but it 
simplifies the presentation as there are fewer cases. By 17(F) we denote the 
submonoid of units , that is, the invertible elements in F. Thus 17(F) is the free 
product over those 7): which are groups. To make our results nonvacuous we 
assume that F is infinite. Also, we assume that the multiplication table for each 
finite group A is part of the input. 

Given F, we choose as a set of monoid generators the smallest subset Af Q F 
which is closed under involution and which contains the sets A (if A = F(A) 
or Fi = A*) and all sets A \ {1} where A is finite. We let n : Af —» F be the 
canonical morphism. 

If F is a group, then F is a finitely generated free product of infinite cyclic and 
finite groups. These groups are also known as plain groups or as basic groups. 
Basic groups form a proper subclass of the class of virtually free groups which 
are those groups having a free subgroup of finite index@ A geodesic triangle in 
the Cayley graph of a plain group, with respect to the standard generators, is 
depicted in Figure [2] 

2 For example, the modular group PSL(2,Z) = Z/2Z * Z/3Z is plain. The group 
SL(2, Z) is isomorphic to the amalgamated product Z/4Z* z / 2 z Z/6Z. Hence, it is 
virtually free, but it is not plain because it is infinite and has a non-trivial center. 
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A word w £ Ap is called reduced if it is the shortest word representing the 
element n(w) £ F. (These words are also called geodesics in the literature.) A 
word w is reduced (resp. geodesic) if and only if for each factor ab < w with 
a, b £ Ap we have first, if a € F(A*) then b ^ a -1 and second, if a £ Fi and if Fi 
is finite then b Fi. 

We identify F C dj with the regular set of reduced words in Ap. Note that 
every word in a free monoid A* is reduced, so factors of the form aa may appear in 
reduced words. Moreover, if Fi is finite, then there might be elements x ^ 1 = x 2 . 
Hence, the equation X 2 = 1 may have a non-trivial solution. These are the 
main reasons why the proof of Theorem [2] is more involved than the proof of 
Theorem [T| 

The monoid for recognizing the regular subset of reduced words is almost 
the same finite monoid as defined in (O) . We replace ([21) by the following monoid 
TVp = (1, 0} U Ap x Af, where 1 • x = x ■ 1 = x, 0 • x = ir • 0 = 0, and 


(a, b) • (c, d) 


0 if be is reduced 
(a, d) otherwise 


(13) 


The morphism ipp : Ap —> Np defined by ipw{a) = (a, a) for a £ Af recognizes 

fca;. 

Note that the group of units is a rational subset of F. We view {/(F) CFC AJ 
and we let U = {a £ Ap | a £ {7(F)}. We use the monoid B = {1,0} in order 
to recognize U U {1}. The recognizing morphism ipjj maps {1} U U to 1 and all 
other letters to 0. Now, consider the monoid 


TV = ({1,0} U A F x Af) x B. 


Define ip : Ap —>■ TV by if)(a) = (ipw(a), ipu{a)). Hence, 


ip(a) 


((a, a), 0) if a £ Ap \ U 
((as, a), 1) if a £ U 


Then ip recognizes U (F) and F simultaneously. This works because 


17(F) = F\AJf- (A F \Z7) • Ajf. 


(14) 


Moreover, if Ap is embedded in any larger alphabet A then, by extending ip 
to a homomorphism ip : A* —> TV by ip{a) = (0,0) for all a £ A \ Ap, we 
obtain that ip recognizes simultaneously all Boolean combinations of the sets 
{1}, {7(F), F, and Ap inside A*. The monoid TV has an efficient representation 
and |TV| < 2(2 + |A f | 2 ). 

In her thesis Michele Benois proved that the family RAT(F) forms an effective 
Boolean algebra. Her statement in [3] is for free groups only, but her proof holds 
for the free product F, too. 

Proposition 2 (Benois, 1969). Let R £ RAT(F) be rational and R = n(L(A)), 
where A = ( Q , Ajjj., S, I , F) is an NFA with n states over the free monoid Ap and 
5 C Q x (Ap U {1}) x Q. Then there exists an NFA A! with n states satisfying 

R = n(L(A')) CL(A'). 
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In particular, RAT(F) forms an effective Boolean algebra. 

Proof. (Sketch) For p,q £ Q let L(jp, q) denote the set of words labeling a path 
in A from p to q. We construct an automaton A' = ( Q, AJ, S', I, F) by defining 
S' as follows. Set S' = S. Repeat the following loop: as long as there are a, b £ Ap 
such that ab ^ 7 r(ab) = c £ A F U {1} with ab £ L(p, q) but (p, c, q) ^ S', replace 
S' by S' U {(p,c,q)}. ^ 

This process takes at most |Q|“ (|Ap| + 1) steps before it terminates; and it 
produces an NFA A! as desired. 

In order to show that RAT(F) forms an effective Boolean algebra, it is enough 
to show that it is effectively closed under complementation. Therefore let A!' be 
an NFA accepting the complement AJ \ L{A'). Since F is a regular subset of AJ, 
the set L(A")nF is regular, hence rational. We have 7r(L(M")nF) = F \R. Thus, 
F \ R £ RAT(F) since F \ R is the homomorphic image of a rational set. □ 

Proposition [2] is crucial: it is the (only) justification for our convention that 
rational constraints X £ R for F are specified by X £ p -1 (to) where p : AJ —>■ N 
is a homomorphism to a finite monoid N and m £ N. Recall that for a given 
morphism a : fl —> AJ the semantics is pcr{X) = m. It is essential to have a one- 
to-one correspondence between RAT(F) and the set {L £ RAT(AJ) | L C F}, 
and this is induced by the homomorphism 7r : AJ —> F. 

3.2 From F to the free monoid AJ with involution. 

EDTOL languages are closed under finite unions. Making non-determinstic guesses 
(which cover all cases) and pushing negations to atomic formulas we may assume 
without restriction that the input <?> is given a conjunction of atomic formulas 
of either type: U = V, U ^ V, X £ p~ l (rn), and X ^ p _1 (m). Recall that 
p : AJ —> N is a homomorphism to a finite monoid. Since we may assume that 
N has at least two elements, we can replace each subformula X ^ p~ 1 (m) by 
X £ p~ 1 {m'), where m ^ m' since, by definition, X £ p _1 (m) refers to an 
evaluation over Ajjj. and not over F. 

Concerning U = V and U ^ V, we use standard triangulation and obtain the 
following equations and inequalities: X ^ Y, X = yz, and X = 1 with X,Y £ 12 
and \x\ = \y\ = 1. Without restriction we can assume that / 0 —1 (1) = {1}, hence 
we can replace X = 1 by the constraint X £ p -1 (l). Thus only equations X = yz 
and inequalities X ^ Y remain. 

Next, we follow a well-known procedure for replacing these equations and 
inequalities over F by equivalent formulas over AJ., using equations and rational 
constraints, only. For an equation X = yz we use the following equivalence, 
which is true for all reduced words x, y, z £ F C AJ.: 

x = 7 r(yz) <*=> 3P, Q, R 3a, b, c £ Af U {1} : R £ U (F)A 

a = 7 r(6c) A x = PaQ A y = PbR A z = RcQ A R £ 17(F). 

For a “visual” proof of this equation see Figure [2j 
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Figure2. Part of the Cayley graph of F: the geodesic triangle to an equation 
x = w(yz). 


For an inequality X ^ Y we use the following equivalence, which is true for 
all words x,y £ Ap (it is here that we use the assumption that F is infinite): 

x , y SF A x ^ y •<=>■ 

3P, Q , R 3a, b, c £ Af : b ^ c A xa = PbQ A ya = PcR A xa £ F A ya £ F. 

Remark 1. We have shown that it is enough to prove Theorem [2] in the special 
case where F = Ap is a free monoid with involution. In particular, all words are 
reduced. The formula ^ is a conjunction of triangular equations X = yz and of 
constraints X £ p -1 (m). The main remaining obstacle is that Ap might contain 
self-involuting reduced words. In particular, there can be a letter a with a = a, 
and even if there is no such letter then still aa is a self-involuting reduced word. 

3.3 How to remove self-involuting letters 

The alphabet Ap is an alphabet with involution, denoted here by ~. As we are in 
the general situation we might have a = a for some letter a £ Ap. Choose some 
subset A+ C Ap such that the size of A+ is minimal while satisfying 

Ap = A+ U {a | a £ A^} . 


By minimality, we have 

A + C {a | a £ A + } = {a £ Ap | a = a} . 

We let A_ be a disjoint copy of A + . We can write A_ = {a | a £ A + } and 
then a = a makes A± = A + U A_ into an alphabet with involution — without 
self-involuting letters. Now we encode words over AJ as words over Alj_ via a 
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morphism i : —>• A±, where t(a) = a and t(a) = a if a ^ o £ A + , and 

i{a) = aa if a = a. Note that i respects the involution. (The morphism i is a 
code because t(Af-) is the basis of a free subnronoid in At j_.) We also define a 
homomorphism 77 : A± —> Ap in the other direction by defining 77(a) and 77(a) 
for a £ A + as follows. We let 77(a) = a and if a / a then 77(a) = a. If, however, 
a = a, then 77(a) = 1 . The homomorphism 77 does not respect the involution, but 
this does no harm. Note also that 77 erases letters. However, the restriction of 77 
to a subset of t(Ajjf) injective. 

We transform in Remark Q] as follows. An equation U = V is replaced 
by i(U) = (.(H), and every variable X receives an additional constraint X £ 
i(Af). Note that t(A£) is a regular subset in A± and we can recognize i(A|) by 
PA f : A* j_ —> Na f , where Na f has size 0 (n 2 ). The precise definition of pA r is 
natural and left to the reader. Previous constraints X £ p~ x (m) are replaced by 
X £ 77 _1 (p _1 (m)). Thus, we have transformed d> over A£U 1 ? into a new formula 
i>' over A± U f 2 . Moreover, the construction guarantees that 77 defines a bijection 
between Sol(<£') and Sol(<£). Thus, if Sol($') = {<^(#) | £ L(_ 4 ')}, then we find 

another NFA A which has just one more state than A', and we obtain: 

Som = 77 (Sol(^) = {77^#) I ip £ L(A')} = H>m | V € HA)} . 

Remark 2 . By the equation above, Remark [U and the last transformations, it 
is now enough to prove Theorem [21 in the special case where F = At j_ is a free 
monoid with involution and <P is a, conjunction of equations and constraints of 
the form A' £ p _1 (?n). Moreover, A± is a set where the involution has no fixed 
points. 

3.4 How to force recognizing homomorphisms to respect the 
involution 

Assume the homomorphism p : AJ. —> N defining the rational constraint in Re- 
mark[l]was in fact a morphism to a finite monoid with involution. This property 
could have been lost during the transformation leading to Remark [ 2 ] 

We now replace a recognizing homomorphism by a recognizing morphism 
to a finite monoid with involution. We show that there is a natural embedding 
of monoids (with or without involution) into monoids with involution. If M is 
any monoid, then we define its dual monoid M T to be based on the same set 
A 1 t = M as a set, where M T is equipped with a new multiplication x oy = yx. 
In order to indicate whether we view an element in the monoid M or M r , we 
use a flag: for x £ M we write x T to indicate the same element in M T . Thus, we 
can suppress the symbol o and we simply write x T y T = {yx) T . The notation is 
intended to mimmick transposition in matrix calculus. Similarly, we frequently 
write 1 instead of 1 T which is true for the identity matrix as well. Now the direct 
product M x AI t becomes a monoid with involution by letting (x, y T ) = (y, x T ). 
Indeed, 

Hi,yT) ■ {x2,vl) = (2/22/1, = (x 2 ,yj) • (xi,yf). 

The following items are essential. 
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— If M is finite then M x M T is finite, too. 

— We can embed M into M x M T by a homomorphism l : M — > M x M T 
defined by i{x) = (x, 1). Note that if ij : MxM T — > M denotes the projection 
onto the first component, then tjl = id m- In particular, every homomorphism 
p : M — > N of monoids factorizes through up : M —»• N x N T . We have 
p = rjip. 

— If M is a monoid with involution and p : M —> N is a homomorphism of 
monoids, then we can lift p uniquely to a morphism p : M —> N x N T of 
monoids with involution such that we have p = rjp. Indeed, it is sufficient 
and necessary to define p(x) = (p(x), p(x) T ). 

Example 2 ($j). Let M = M nxn . Then M x M T = B nxrl x (B nxn f is a sub¬ 
monoid of the set of 2 n x 2n-Boolean matrices: 


{( 


p o 

0 q t 




l } with (o Q t ) = (o P°0 ■ 


In the line above P T and Q T are the transposed matrices. 


Remark 3. It is enough to prove Theorem [2] in the special case where F = A± is 
a free monoid with involution and <P is a conjunction of equations and constraints 
of the form X £ /i -1 (m), where p : A± —> N is a morphism between monoids 
with involution. Moreover, A± is a set where the involution has no fixed points. 


3.5 The initial equation Wi„it 

Recall that, due to Remark[3l we may assume that we are in the special situation 
where F = A± and A± is a set where the involution has no fixed points. The next 
few steps are quite similar to those in the proof of Theorem [Q We introduce a 
special symbol # with # = #, and we let A = A± U {#} be the initial alphabet 
with involution. In this alphabet no other symbol except # is self-involuting. 

All rational constraints are given by a morphism p 0Q : A± — > N where N is 
a finite monoid with involution which has a zero. Without restriction we assume 
that ^oo(l) _1 Q {1} an< I that f° r all w £ A± we have poo{w) ^ 0 w £ 

t(Af), where i is the embedding of Ap into A± from above. We extend poo to 
a morphism po : A* —> N by po(#) = 0. We then extend po to a morphism 
Pinit ■ ( A U 17)* —> N by guessing the values for 0 ^ pinit(X) = pimt(X) £ N. In 
the following ptnit (x) changes frequently for symbols outside of A, thus we use 
p,p' as generic notation. However, the following will hold throughout: po(a) = 
Minit(a) = /z(a) = p'(a) for all a £ A. The starting point is now a system of 
equations Ui = Vi with Ui,Vi £ (A± U 17)* for 1 < * < s and the morphism 
Pinit : (A U 17)* —> N. We don’t care that \UiVi\ = 3 anymore. 

We let U 1 = t/i# • • • #t4 and V’ = Vf# • • • #V a . Therefore, we have only 
one single equation. Next, we define the actual initial equation Winit as in ([5]) . 

W init =#*!#... #X t #U'#V'W! 7 #I 77 #^# • ■ ■ m#- (15) 


28 





In particular, according to © we have A± U £2 = {x \,..., xi} with Xi = X, for 
1 < i < k. The overall strategy to proving Theorem [5] is as before - we show 
that the following language is EDTOL: 


{tr(Xi)# • ■■#a(X k ) | o-(Winit) = cr(Winit) A /q n it = fio<J A VX : a(X) = <j(X)} . 


Recall that for every morphism a : Q —> A± we have 


a(U') = a(V') <=* cr(W init ) = a(W init ). 


We fix some large enough n = n init G 0(\Wi n i t \) and alphabet C of size (D(n). 
We assume that ACC and no other symbol than # £ C is self-involuting. In 
contrast to the above we content ourselves with \C\ C O(n); we leave it to the 
reader to calculate large enough constants. 

As usual we let A C B, B' C C and we assume that B and B' are closed under 
involution. By X , X’ we denote subsets of 12 which are closed under involution, 
too. Moreover, we let S = C U 12. 

The letter y refers to a morphism /i : (BUX)* —> N defined by y : BUX —>■ N 
where N is a finite monoid with involution. We assume that y(a) = yo(a) for 
all a C A. The morphism y encodes the rational constraints: in particular, using 
the appropriate /i we can guarantee that solutions are in c( F). 

3.6 Partial commutation induced by a type relation 

The definition of type relations appearing in Q was more general than necessary. 
For the proof of Theorem [2] we restrict the type relations in order to focus on 
what we need, because we use now compression arcs which have have not been 
used in Theorem [L] The main difference between the proofs of Theorem [T| and 
Theorem [2] is that reduced words may have self-involuting factors of the form 
aa and that compression of aa into a letter would lead to self-involuting letters, 
which we must avoid. The solution is simple: never compress any factor aa. It 
will be enough to compress factors (aa) 1 for £ > 2 down to a self-involuting word 
of length 2. 

We restrict the rather general notion of type relation over S = C U 12 as 
follows. Since it is the restriction of the earlier definition, 6 is an irreflexive 
and antisymmetric relation, where ( x,y ) C 6 implies (x, y) £ 9. In addition, we 
require that first, (x,y) € 6 implies y £ {c,c, cc} for some cCC and that there 
is no (x’,y') G 6 with x G {c,c}*, and second, only the following two forms of 
type relations 9 are allowed, where cG C\ {#} is a letter. 


0 C {(a;, c), (x, c) \ x G X \ {c, c}} . 

9 C {(X, cc) | X G 12} U {(aa, cc) | a £ C \ (c, c}} . 


(16) 

(17) 


Moreover, \9(x)\ < 1 where 6(x) = {y £ C + \ (x,y) G 9}. Clearly, we maintain 
\9\ G O(n), which allows us to store 9 in quasi-linear space. Given 9 and p : 
B U X —>• N such that n(xy) = p(yx) for all (x, y) G 9 we define as above the 
following two partially commutative monoids with involution. 
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1. M(B, X, 9, p) = (B\JX)*/ {xy = yx \ (.x , y) £ 9 }, a monoid with a morphism 
p : M{B,X, 9,p) ->• AT. 

2. M(B,9, p) = B*/ {xy = yx \ (x, y) £ 9}, a submonoid of M(B, X, 9, p) such 
that 

p : M(B, 9, At) M(£, A, (9, /r) -A AT. 

Let us recall that if w < W £ M{B,X,9,p), then w is called a proper factor 
if w ^ 1 and = 0 and that, since the defining relations for M(B, X, 9, p) 
are of the form xy = xy, we can define \W\ and \W\ a for W £ M(B,X,9,p) 
by representing W by some word W £ (B U X)*. It follows that we can decide 
w < W in quasi-linear space for w, W £ M{B, X, 9, p). 

As above, W £ M(B , X, 9, p) is well-formed if it is well-formed according to 
Definition O We repeat the definition of an extended equation and apply it to 
the restricted version of type relation. 

Definition 5. An extended equation is a tuple V = ( W,B,X,9,p ) where W £ 
M(B,X,9,p) is well-formed. A 13-solution of V is a B-morphism a : M(B U 
X, 9, p) —>• M (13, 9, p) such that cr(W) = <j{W) and cr(X) £ c* whenever ( X , c) £ 
9. A solution of V is a pair (a, a) such that a : M{B,9,p) —> A* is an A- 
morphism satisfying pga = p and a is a B-solution. 


3.7 The directed labeled graph Qy. 

Define the graph Gw to be the induced subgraph of Q which is defined by the 
set of all extended equations ( W, B, X ,9, p) where 9 satisfies the specification 
as above. (If necessary, adopt the constant n to be large enough, say k = 100.) 
In particular, 9 is either of the form m or inn. The restriction is imposed in 
order to focus on the essential arcs, and it is allowed to start with Q as defined 
originally. In particular, we keep the set of initial vertices, which are the vertices 
of the form 

(Winit, A, fl, 0, A^init)- 
The set of final vertices is again 

{(W,BMp) | W = W}. 

All arcs in Q that are between vertices of C/f are also arcs in Gw, since we consider 
the induced subgraph. In particular, Proposition |Tj still holds, and states the 
following. 

Proposition 3. Let Vq —b V± ■ ■ ■ Vt be a path in Gw of length t, where 
Vo = (Winiti A, fl, 0, Atinit) is an initial and Vt = {W', B, 0, 0, p) is a final vertex. 
Then Vo has a solution (id^, a) with er(Wi n it) = h\ ■ ■ ■ ht(W'). Moreover, we 
have W' £ ffuiff ■ ■ ■ ffukffB* such that |uj |^ = 0 and we can write: 

hi--- h t (u!# ■ ■ ■ #u fe ) = a(Xi)# ■ ■ ■ #cr(A fc ), (18) 

Proof. See Proposition [T) □ 
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4 General compression 


We can now give the proof of Theorem [2j following the same scheme as for free 
groups and the proof of Theorem [T| 

Consider an initial vertex Vo = (Winit> 4, Q, 0, /q n i t ) with a solution (a, a). 

We show that Qp contains a path Vo —4 Vi • • • V) to some final vertex 
Vt = (W 7 , B, 0,0, /j) such that cr(Wi n it) = h\ ■ ■ ■ h t (W'). We show the existence 
of the path using a repetition of the sequence: 

“block compression”, “non-standard block compression”, “pair compression”. 

Let us recall that the scheme is repeated until we reach a final vertex and that 
the procedures use some external knowledge about solutions. We proceed along 
arcs V V ' in thereby transforming a solution (a, a) to V into a solution 

(a\a r ) to V ' such that we keep the invariant aa(W) = a'ha'(W'). 

4.1 Standard block compression 

The procedure has been described above in the main body of the paper as 
block compression. We start at a non-final vertex V = {W 1 B , X , 0, /i) with a 
solution (a, a) with \B\ < \W\ G 0(n). We move along arcs satisfying the forward 
condition and we arrive at a vertex V ' = (W 7 , B', X 1 , 0, p!) with a solution 
(a!, a'). We have \B\' < |V ; G| / G \W\ + 0(n), so there is a possible increase by 
0(n ) in the length of the equation, but we know that W' does not contain any 
proper factor b 2 with b G B'. At the end of the standard block compression we 
rename V' = (W 7 , B' , X' , 0, //) and (a/, cr') as V = (W, B , df, 0, //) (a, a), but we 
keep in mind the increase of length by O(n). 

After that, we start the non-standard block compression to remove all factors 
aaa from er(kV). This is explained next. 

4.2 Non-standard block compression 

We follow the explanation and notation according to the main body of the paper. 
We consider some non-final vertex V = (W, U, X, 0, p) with an empty type 
relation and a solution (a, a). Let B \ {#} = B + U B_ be any partition such 
that b G B + b G £?_. Recall that aa could be a reduced word. As aa is 

a self-involuting word we cannot compress it into a single letter c because then 
the letter c is forced to be self-involuting, since compression of a factor aa = aa 
must be unambiguous. Note that aa has no non-trivial self-overlap since a 

Let c G C \B and h : C* —> C* be the renaming homomorphism defined by 
h{c) = a and h(c ) = a. Let w G B* be any word. Then there is a unique word 
w' G B* such that w = h(w') and w' does not have any factor aa though it may 
have a factor aa. The word w' can be obtained by replacing every occurrence of 
aa by cc. The word w' is unique because aa has no non-trivial self-overlap. 

Let us introduce the following notation. For a € C, w £ E* = (C U 17)*, and 
A > 1 such that ( aa) x is a factor of w. We say that an occurrence of (ao) A in 
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w is maximal if the occurrence corresponds to a factorization w = u(aa) x v such 
that neither u £ S*aa nor v £ aa£*. This means that at least one occurrence 
of ( aa) x in w is not contained in any occurrence of a factor (aa) A+1 . Note that 
(aa) 1 , {aa) 2 , (aa) 3 may be factors in some w, both (aa) 1 and ( aa ) 2 have maximal 
occurrences, but (aa) 3 does not: for example, w = (aa) 1 #(aa) 2 #(aa) 4 . Note also 
that aa has a maximal occurrence in iu = aaaa. 

We can repeat, partly verbatim, the standard block compression, but there 
are slight modifications, and we divide the procedure into smaller steps. 

Remark 4- Before we describe the procedure in mathematical terms, let us try to 
give a a high level explanation what a non-standard block compression does. The 
basic idea is simple. In order to avoid self-involuting letters we cannot compress 
cc into a single letter, but we can compress cccc into the word cc. This means 
we can compress maximal blocks (cc) 2 * into blocks (cc)*. The compression must 
correspond to morphisms. This is fine: the morphism c t —> cc maps cc to cccc. 
But then we can continue the same way only if i is even. Therefore there is some 
extra work necessary if £ becomes odd. If there is a maximal block (cc)* with £ 
odd then we replace first (cc)* by caCa(cc )* -1 where ca is a fresh letter. Once 
we have CaCa available, we can compress (cc)caCa(cc) into caCa- The morphism 
maps ca to ccca • Thus, caca is mapped to (cc)caca(cc) which is equal to caca(cc ) 2 
due to partial commutation. So at the end every maximal block (cc)* where t 
is even or odd gets compressed into some caca- One could say that caca is a 
special pair with a sort of marker which always guesses correctly whether it sits 
“inside” some block (cc)* with £ = 2 mod 4 or with £ = 0 mod 4. This is why 
counting mod4 comes in. 

begin non-standard block compression 

We begin at vertex V = (IT, B, X, 0, n) with a solution (a, a). During the 
process we introduce partial commutation but it vanishes at the end. After each 
transformation we rename the current vertex as (IT, B , X, 6 , /a) with a solution 
(a, a). During the (aa)-compression we enlarge the alphabet and then the stan¬ 
dard notation for a vertex becomes (IT, B' , X, 9 , /i). 

1. Follow arcs of type 4 and 6 in order to remove all variables with |<r(A)| < 10. 
Thus, without restriction, we have |<r(A)| > 10 for all X. If V became final, 
we are done and we stop. 

2. For each X we now have cr(X) = bw for some b £ B and w £ B + . Following 
a substitution arc 6, we replace A' by bX, A by A b and change /i(A) to 
/i(A) = /i(A) = n(w). Now, if bX < IT and b'X < IT are factors with 
b,b' £ B then ^^6 = 6' due to the previous substitution A >->• bX. 

3. For each a £ B + define sets A a C N which contain those A > 1 such that 

there is a maximal occurrence of (aa) x in cr(IT) where at least one of the a’s 
is visible. Note that we treat a £ B + different from a £ B _. However, this is 
not essential here. If there is a maximal occurrence of ( aa) x in cr(IT) where 
at least one of the a’s is visible then there is another maximal occurrence of 
(aa) A in cr(IT) where at least one of the a’s is visible. We have X^ a eB + 14,1 < 
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\W\. We also let 


X a = {X G X I aX <W A cr(X) 6l5*VaI<ffA <t(A) G amB*} . 

Note that if X G T a for some a G B+ then either the factor aa or the factor 
aa or both have a “crossing” in ct(IT). 

4. For each A a ^ 0 - one after another - run the following subroutine, called 
(aa)-compression. The purpose is to remove all proper factors (aaY with 
a G B + and i > 1 from IT. More precisely, if (aaY is a maximal occurrence of 
that factor in IT then the following (aa)-compression replaces this occurrence 
(aa) 1 by some factor caca. We do not have i = A in general, but clearly 
|(aa) f | > |cacX| = 2. 

end non-standard block compression 


Subroutine (aa)-compression. 

The subroutine is called at a vertex V = (IT, B, X, 0, p) with solution (a, a). 

begin (aa)-compression 

1. Introduce fresh letters c 0 ,cfT with p(c a ) = p(a). In addition, for each A G 
A a introduce fresh letters CA,a, CA,a with p(c\, a ) = p(a). Define h(c\ t a) = 
h(c a ) = a and introduce a type by letting 

Q — {(CA,aCA,a5 C a C a ) | A G . 

Renaming arcs 1 realizes this transformation. We did not touch IT, hence 
IT = h(W), but we enlarged B to some set B' and we introduced partial 
commutation. We abbreviate c = c a , c = c\ = c\ ta , and ca = CA, a - 

2. (Change IT and its solution.) We replace in cr(IT) G B* every maximal 
occurrence of a factor (aa) x with A G A a by (cc) A . This yields a new word 
W' G B' . The transformation can be realized again by a single renaming 
arc defined by h(c) = a and leading to a solution (c/,cr'). 

Various c and c appear in IT' and <j'(W'). Note that every c in <j'(W') is 
followed by some c and every c is preceded by some c. Some of their positions 
are visible. For example, W' may have factors of the form XcY or XccccY 
etc. We rename the vertex and its solution as (IT, B\ X , 9, p) and (a, a). 

3. Consider all X G X a where cr(X) is a factor of some word in (cc)*. (For 
example, <j(X) G c(cc)*c.) Follow substitution arcs 6 such that first, for the 
resulting solution o' we have cr'(X) G (cc) 4m for some m G N and second, 
each occurrence of such X G X a in W' occurs inside an occurrence of ccX. 
As usual, we rename the vertex and its solution as (IT, B', X , 9 , p) and (a, a). 

4. (Typing variables.) Enlarge 9 such that it becomes 

0 = {(caca, cc) | A G Aa} U {(A, cc) | X G X a A a(X) G (cc)*} . 
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5. For all X £ X a where a(X) ^ (cc)* factorize a(X) = uv such that u is a suffix 
(possibly empty) of some word in (cc)* and v £ ccM(B',6, p). Following 
more substitution arcs we can make sure that first, u £ (cc) 4m for some 
rri £ N and second, every occurrence of such X is W occurs as a factor ccX. 
This is due to the definition of X„. 

After renaming we have either <r(X) £ ((cc) 4 )* or cr(X) £ ((cc) 4 )*i>((cc) 4 )* 
with 1 v ccM(B', 6 , p) U M ( B ', 9, p)cc. 

6. Remove all variables with cr(X) = 1 and rename the current vertex as 
(W,B',X,0,p). 

7. Call the following while loop, called A a -compression. Repeat the loop until 
A a = {0}. In the beginning of the loop we have A a ^ 0 and we don’t have 
0 £ A a , but the number 0 sneaks in. Actually we will have that A a = {0} if 
and only if and there are no more factors cc. Note that there at least as many 
subsets {ca,ca} available as there are numbers in A a . During the following 
process we will mark some c\, ~c\ to make sure that we use each set {ca, ca} 
only once. At the beginning all c\ are unmarked. 

begin A a -compression 

As above, we realize each transformation via arcs satisfying the forward prop¬ 
erty. We continue to denote, by default, each current vertex as (W, B ', X, 6 , /z) 
and the current solution as (a, a). 

The first while loop is realized as a path in by renaming arcs with label 
h(c\) = c; the second one uses substitution arcs with label h(c\) = ccc\. 

Let A = A a . 

while A ^ {0} do 

(a) while there is l £ A such that l is odd do 

— let i be largest odd number in A; 

— choose an unmarked letter c\ and mark c\ and ca; 

— replace every maximal occurrence of (ccY by caCa(cc)^ 1 ; 
whenever possible make the factor caca visible; 

(we have caca(cc)^ -1 = (cc)^caca(cc)^ 2 £ M(B',9,p) for all l\ + 

= l — 1, hence Caca can be chosen to be visible for an occurrence 
of (cc) £ unless no position of that occurrence is visible) 

— replace A by (A \ {£}) U {£ — 1}. 
end while 

Note that now, if (cc) £ is a maximal occurrence in a(W) then £ £ A and 
all £ £ A are even. 

(b) while a(W) has a maximal occurrence of some factor c\c\{cc) 1 with 
£ = 2 mod 4 do 

— let £ be largest number in A such that there is a maximal occurrence 
of the factor caca(cc)^ with £ = 2 mod 4; 

— replace every occurrence of ca5a(cc)^ by ca5a(cc) { ~ 2 ; 

realize this transformation via some substitution arc with label h(c\) = 
cccy, note that h[c\c\) = cccacacc = caca(cc) 2 ) £ M(B',9,p) 

— replace A by A U {£ — 2}; 
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— if a(W) does not contain any maximal occurrence of (ccY then re¬ 
place A by A \ {£}. 

end while 

Note that the first two while loops did not use any substitution arc since 
no “uncrossing” was necessary. Moreover, now all i £ A are even and 
if there is a maximal occurrence of some factor caca(cc)^ then we have 
£ = 0 mod 4. Thus dividing such value £ by two keeps this value even. 

(c) Follow a substitution arc with label h(c) = cc in order to replace all 
maximal occurrences of factors (cc) e with £ £ A by (cc)^/ 2 ; 

replace A by the set {A/2 | A € A}; 

(note that A may have odd numbers, again) 

(d) Remove all variables X with |cr(X)| < 10. 

(e) For each of the remaining X £ X a there is a unique factorization cr(X) = 
uv such that u £ (cc)* and v £ (cc)* M(B 1 , 6 , /x) . Using substitution arcs 
we may assume that u £ (cc) 4m for some m £ N. 

end while 

end A a -compression 

8. Let B = B'\ {c,c}. Since A a = {0} after at the end of the A a -compression, 
no c or c appears in <j(W): they are all compressed into single letters c\ 
or c\. Moreover, for x £ B U if we have 8(x) = 0. Hence we can follow an 
alphabet reduction arc ( W., B' , X, 6 , /x) {W, B, X } 0, /x). The new solution 

to (IF, B, X, 0, /j) is the pair (a', a) where a' = ae is defined by the restriction 
of a to the free monoid M(13, 0,/x'). We rename the current vertex and its 
solution as (W, B , X, 0, /x) and (a, a). 

end (aa)-compression 

Having performed one round of a non-standard block compressions, we have 
increased the length of W by at most 0(n). We end up at a vertex named again 
as V = ( W , B, X, 0, fi) which has a solution (a, a). The difference is that W has 
no proper factor aaa with a £ B anymore. 


Repeat: 1. standard block compression, 2. non-standard block 
compression, 3. pair compression until X — 0 

The title of this section explains what we do. We repeat rounds of 1. standard 
block compression, 2. non-standard block compression, 3. pair compression. After 
the non-standard block compression the word W does not contain any proper 
factor of the form a 1 with £ > 2 or aaa where a £ B. Thus, at the beginning of 
pair compression a proper factor u of W of length three looks as u = abc with 
either c {b,b} or a £{M}. 

To see that, indeed, all factors aaa vanish, consider for example the case that 
before (and after) the standard block compression the word W contains a factor 
baaab with a ^ b. After that the non-standard block compression uses renaming. 
It changes this factor either to bccab (if a £ B+) or to baccb (if a £ B + ); and 
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at the end of the non-standard block compression this factor appears either as 
bc{ciab or as bac\C\b. 

The pair compression we proceed according to Section [231 but we never com¬ 
press any pair aa. Now consider again u < W with u = abc and either c ^ { 6, fo} 
or a ^ {6,6}. By symmetry, we may assume a £ {6,6}. Now the probability 
that ab is compressed is Pr [ a £ L A 6 e R] = Pr [a £ L] ■ Pr [6 £ i?] = \. It 
follows that the expected length of that factor u after pair compression is at 
most 3 • | + 2 • j = -j- which is less than 3. Thus, we obtain a recursion of type 
s(l) £ 0(n) and s(i + 1) < q ■ s(i) + 0(n) for all? £ N where q = < 1. Such 

a recursion implies s(i) £ 0(n) for all i £ N. This proves Theorem [2j □ 
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